Skip to content

rfc: temporal axis for contradiction probe#1007

Closed
garrytan wants to merge 1 commit into
masterfrom
wintermute/temporal-contradiction-probe
Closed

rfc: temporal axis for contradiction probe#1007
garrytan wants to merge 1 commit into
masterfrom
wintermute/temporal-contradiction-probe

Conversation

@garrytan
Copy link
Copy Markdown
Owner

@garrytan garrytan commented May 15, 2026

What

A report from resolving ~115 HIGH contradiction findings in a production gbrain run, plus a proposal for a 4-phase fix. Documents why a majority of residual findings are temporal false positives and proposes a time-axis enhancement to the contradiction probe.

Why

The probe treats all claims as timeless. A status change recorded as "trial" in April and "confirmed" in May gets flagged as a contradiction. A role transition recorded in 2017 vs. an updated role in 2025 gets flagged. These are features of a brain that records history, not bugs.

Use Cases (synthetic placeholders in the RFC text)

  1. Founder tracking — claimed vs. realized metrics across office hours. Detect regressions, score prediction accuracy, surface narrative drift.
  2. Event disambiguation — two distinct events within a short window. Two events, not one contradiction.
  3. Role/status changes — people change roles. Historical records aren't wrong; they're history.
  4. Decision evolution — multi-step decisions where each step supersedes the previous.

Proposed Phases

  1. Judge prompt enhancement (smallest change, biggest impact — pass dates to judge, add temporal_supersession verdict)
  2. Claim trajectory view (e.g. gbrain eval trajectory "Acme Corp MRR")
  3. Automatic valid_from/valid_until population during extract_facts
  4. Founder scorecard

Stats

  • ~107K pages, ~257K chunks
  • ~115 HIGH findings → ~0 genuine contradictions remaining after manual resolution
  • A majority of original findings were temporal false positives

This is a report/RFC, not a code change. Phase 1 is the recommended starting point.

Field report on residual HIGH findings from gbrain eval suspected-contradictions
and proposal for a 4-phase fix (Phase 1 = judge prompt + verdict enum is the
recommended starting point).

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
@garrytan garrytan force-pushed the wintermute/temporal-contradiction-probe branch from 764f4a7 to ef5a350 Compare May 16, 2026 00:12
@garrytan garrytan changed the title rfc: temporal axis for contradiction probe — field report from Wintermute rfc: temporal axis for contradiction probe May 16, 2026
@garrytan
Copy link
Copy Markdown
Owner Author

Closing — branch renamed to drop a private agent name from the head ref per privacy rule. Same commit, same content, reopened as a fresh PR.

@garrytan garrytan closed this May 16, 2026
@garrytan garrytan deleted the wintermute/temporal-contradiction-probe branch May 16, 2026 00:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants